Chemical Identification using Bayesian Model Selection
نویسندگان
چکیده
Remote detection and identification of chemicals in a scene is a challenging problem. We introduce an approach that uses some of the image’s pixels to establish the background characteristics while other pixels represent the target for which we seek to identify all chemical species present. This leads to a generalized least squares problem in which we focus on “subset selection” to identify the chemicals thought to be present. Bayesian model selection allows us to approximate the posterior probability that each chemical in the library is present by adding the posterior probabilities of all the subsets which include the chemical. We present results using realistic simulated data for the case with 1 to 5 chemicals present in each target and compare performance to a hybrid of forward and backward stepwise selection procedure using the F statistic. Introduction We consider infrared hyperspectral image data in which we first locate chemical plumes and then characterize the chemical components of the plume (McVey et. al., 2002). Our focus is chemical identification from a library of tens to hundreds of chemicals using a passive infrared (IR) detector several kilometers above the ground. A typical scene is 1000 x 128 pixels, where the detected signal at each pixel depends on the ground radiance, atmospheric transmission, instrument noise, and whether a chemical plume lies between the ground and the detector. A prior analysis identifies the plume pixels (pixels that have a plume influencing the signal) from among the background pixels (pixels that do not have a plume influencing the signal). Simplifying assumptions lead to a general least squares (GLS) problem for which we choose which subset (up to 3 chemicals from approximately 100 chemicals for example) of chemicals is most likely to be present in the plume. The next section presents a physical model and the assumptions to convert this to a GLS problem. Following sections present model selection approaches including Bayesian model averaging (BMA) and “pick the winner” (PW) using penalized likelihood (the F test is a special case). We present results with simulated data, and include cases where the errors do not have a Gaussian distribution and the predictors have nonnegligible measurement error. We conclude that the false positive and negative rates for BMA are similar to those for PW provided the appropriate penalty is used with PW. Background A hyperspectral detector detects photons emitted in the IR region from the ground. The signal from background pixel i is ( ) ( ) ( ) ( ) g b i i j j j i j i S L N ε ν ν τ ν ν = + (1), where ( ) i j ε ν is the emissivity at wavelength (or frequency) j ν , ( ) g j i L ν is the Planck function at ground temperature, ( ) j τ ν is the atmospheric transmission, and ( ) i j N ν is the noise. The signal from plume pixel i is ( )[ ( ) ( ) ( )] ( ) (2), p p g b p j j i j j j i i i i S L L S α ν ν ε ν ν τ ν = − + where ( ) p j α ν is the plume absorption and ( ) p j i L ν is the Planck function at plume temperature. The plume has two effects: it emits in the IR region, but it also absorbs the radiation emitted from itself and from the ground. The ( ) i j ε ν terms (emissitivites) depend on the properties of the background. Concrete, asphalt, buildings, grass, dirt, water, and other common background features each have their characteristic emissitivity. Our synthetic background scenes are generated from various mixtures of approximately 100 typical background emissitivites. Our synthetic plumes are generated from a library of approximately 100 chemical species of interest. Typically there are 10,000 to 16,000 pixels in a scene with 1 to a few plumes. The number of pixels in a plume is 10’s to 100’s. Spring Research Conference Section on Physical & Engineering Sciences (SPES)
منابع مشابه
Project Portfolio Risk Response Selection Using Bayesian Belief Networks
Risk identification, impact assessment, and response planning constitute three building blocks of project risk management. Correspondingly, three types of interactions could be envisioned between risks, between impacts of several risks on a portfolio component, and between several responses. While the interdependency of risks is a well-recognized issue, the other two types of interactions remai...
متن کاملیک مدل بیزی برای استخراج باناظر گرامر زبان طبیعی
In this paper, we show that the problem of grammar induction could be modeled as a combination of several model selection problems. We use the infinite generalization of a Bayesian model of cognition to solve each model selection problem in our grammar induction model. This Bayesian model is capable of solving model selection problems, consistent with human cognition. We also show that using th...
متن کاملInferencing Bayesian Networks from Time Series Data Using Natural Selection
This paper describes a new framework for using natural selection to evolve Bayesian Networks for use in forecasting time series data. It extends current research by introducing a tree based representation of a candidate Bayesian Network that addresses the problem of model identification and training through the use of natural selection. The framework constructs a modified Naïve Bayesian classif...
متن کاملBayesian change point estimation in Poisson-based control charts
Precise identification of the time when a process has changed enables process engineers to search for a potential special cause more effectively. In this paper, we develop change point estimation methods for a Poisson process in a Bayesian framework. We apply Bayesian hierarchical models to formulate the change point where there exists a step < /div> change, a linear trend and a known multip...
متن کاملمقایسه روش های مختلف آماری در انتخاب ژنومی گاوهای هلشتاین
Genomic selection combines statistical methods with genomic data to predict genetic values for complex traits. The accuracy of prediction of genetic values in selected population has a great effect on the success of this selection method. Accuracy of genomic prediction is highly dependent on the statistical model used to estimate marker effects in reference population. Various factors such a...
متن کاملEstimation of genetic parameters of litter size in Moghani sheep using threshold model via Bayesian approach
This study was conducted to estimate the genetic parameters of litter size (LS) in Moghani sheep using threshold model via Bayesian approach. The data originated from the Jafar-Abad Station of Ardabil province, Iran, and included 9698 lactation records of 4977 ewes with lambings from 1995 until 2010. The pedigree file consisted of data on animals born from 1987 to 2010. The significance of fixe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002